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It is well known that the Euclidean algorithm or its equivalent , continued fractions, 
can be used to find the error locator polynomial needed to decode a Reed-Solomon (RS) 
code. It is shown in this article that this algorithm can be used for both time and trans- 
form domain decoding by replacing its initial conditions with the Forney syndromes and 
the erasure locator polynomial. By this means both the errata locator polynomial and the 
errata evaluator polynomial can be obtained with the Euclidean algorithm. 

With these ideas, both time and transform domain Reed-Solomon decoders for cor- 
recting errors and erasures are simplified and compared in this article. Asa consequence , 
the architectures of Reed-Solomon decoders for correcting both errors and erasures can 
be made more modular , regular , simple, and naturally suitable for VLSI implementation. 


I. Introduction 

The Euclidean algorithm for solving the key equation for 
decoding Bose-Chaudhuri-Hocquenghem (BCH) and Goppa 
type codes was first developed by Sugiyama et al [1]. The 
authors [2] , [3] derived a fast decoding of Reed-Solomon 
(RS) codes using the continued fraction, which is closely 
related to the Euclidean algorithm. Brent and Kung [4] were 
the first to suggest a systolic array architecture for computing 
the greatest common divisor (GCD) of two polynomials. 
Through the use of these ideas, a pipeline structure for a trans- 
form domain decoder was developed to decode errors of RS 
codes [5] . An important ingredient of this design is a modi- 


fied Euclidean algorithm for computing the error locator 
polynomial. 

The computation of inverse field elements is completely 
avoided in the above-mentioned modification of Euclid’s 
algorithm. Recently, the authors [6] proposed that a recursive 
algorithm could be used to perform this modified Euclidean 
algorithm. An important advantage of this new recursive algo- 
rithm is that the entire systolic array needed to perform 
Euclid’s algorithm requires substantially less silicon area than 
the pipeline version of the modified Euclidean algorithm given 
in [5], 
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Forney [13] defined an errata locator polynomial using 
what are now called Forney syndromes to correct both errors 
and erasures. Blahut [7] showed that the errata locator poly- 
nomial can be computed directly by initializing Berlekamp’s 
algorithm with the erasure locator polynomial. 

Recently Eastman [8] suggested that the errata evaluator 
polynomial can be computed directly by initializing Berle- 
kamp’s algorithm with the Forney syndrome polynomial. This 
new, simplified decoding procedure is proved in [9] . By this 
technique, it is possible to compute the errata locator polyno- 
mial and the errata evaluator polynomial simultaneously from 
the Euclidean algorithm. This new RS decoder uses both the 
erasure locator polynomial and the Forney syndrome polyno- 
mial as initial conditions for the Euclidean algorithm. 

It is shown and proved in [9] that the modified Euclidean 
algorithm mentioned above can be used to solve the Berle- 
kamp-Massey key equation for the errata locator polynomial 
and the errata evaluator polynomial directly and simultane- 
ously. By this means a new, simplified pipeline architecture 
for both the time and transform domain decoders can be 
developed for correcting both errors and erasures of RS codes. 
Such a decoding technique can be faster and simpler than 
previous methods [15] , [10] . 

In this article, it is found that the VLSI implementation of 
the transform domain decoder is simpler than that of the time 
domain decoder. However, for a long RS code (10 bits or 
larger), due to the large size of the inverse transform unit 
needed in the transform decoder, the VLSI area needed to 
implement the transform domain decoder can be substantially 
larger than that needed for the time domain decoder. For 
moderately long codes, such as the 8-bit (255,223) RS code 
used in the concatenated coding system for NASA’s Voyager 
mission [11], the transform domain decoder is still simpler 
than the time domain decoder. 

The above-mentioned NASA coding system is called the 
“baseline” system. It uses a (7, 1/2) convolutional code as 
its inner code and an 8-bit (255,223) RS code as its outer 
code. It is shown [12] that this system achieves a bit -error 
rate (BER) of 10"" 6 at a bit signal-to-noise ratio (SNR) of 
2.53 dB. 

As mentioned above, the time domain decoder is more 
efficient in area than the transform domain decoder for very 
long RS codes. One such example is the long, 10 bits/symbol 
(1023,959) RS code presently being considered for very deep 
space probes. If this code is concatenated with a (15,1/5) 
convolutional code, it achieves a BER of 10” 6 at an SNR of 
0.5 dB [12]. Evidently the new NASA concatenated coding 
system provides a 2 dB improvement over the present baseline 


system. It is for this reason and many other applications that 
it is important to develop an efficient, VLSI implementable, 
time domain RS decoder. 

II. The Time Domain Decoder for RS Codes 

An algorithm is developed in [15] for time domain decod- 
ing of RS codes to correct both errors and erasures through 
the use of continued fractions or their equivalent, Euclid’s 
algorithm. This algorithm is a modification of the Berlekamp- 
Forney method [13] , [14] . In this algorithm, the continued 
fraction algorithm is used to find the error locator polyno- 
mial from the remainder of the formal power series for the 
Forney syndrome. The disadvantage of this algorithm is that 
after the error locator polynomial is obtained by continued 
fractions, two polynomial multiplications are needed to com- 
pute the errata locator polynomial and the errata evaluator 
polynomial from the known error locator polynomial. 

In this section, the above-mentioned algorithm is modified 
to correct both errors and erasures in the time domain decod- 
ing of RS codes by the use of the Euclidean algorithm. In this 
new algorithm, the Euclidean algorithm is used to solve the 
Berlekamp-Forney key equation for the errata locator polyno- 
mial and the errata evaluator polynomial directly and simulta- 
neously. The advantage of this algorithm over previous methods 
[15] is that separate computation of the errata locator poly- 
nomial and the errata evaluator polynomial, which is usually 
needed [15], can be avoided. This new decoding algorithm 
is highly suitable for both VLSI and software implementation. 

First, let GF( 2 m ) be a finite field of 2 m elements. Also, let 
N = 2 m - 1 be the length of the ( N,I ) RS code over GF(2 m ) 
with minimum distance d, where I-N-(d - 1) denotes the 
number of m - bit message symbols and d - 1 denotes the num- 
ber of parity symbols such that d - 1 is either an even or an 
odd integer. The following five vectors are defined as: 

c = (c 0 ,c x , . . . ,%_!), code vector 
r = (r 0 ,r 1 ,..., received vector 

e = O 0 > e i> • • • error vector 

u = (u 0 ,u j, . . . erasure vector 

u = (u 0 ,u x , ... , w A r_ 1 ), errata vector 

These vectors are related by u = e + u and r = c + u + e. 

Suppose that t errors and v erasures occur in the received 
vector r, and assume that v + 2t < d - 1 . Next let a be a primi- 
tive element in GF(2 m ). Then 7 = a i is also a primitive element 
in GF(2 m ), where (/, N) = 1 . 
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To minimize the complexity of an RS encoder it is desir- 
able that the generator polynomial be symmetric. If 7 is a root 
of the code’s generator polynomial, it is shown [16] that the 
generator polynomial £(x) is symmetric if and only if 

b+(d- 2 ) d-\ 

g{x) = fl ix-y‘) = E g i x ‘ (1) 

i=b i=0 

where g 0 = g d _ 1 = 1 and b satisfies the equality 2 b + d - 2 = 
2 m - 1 . The syndromes of the code are given by 

V,M - E v'"- 1 ** 1 ■ £ 

i=0 i=0 

v + 1 

= 2 y.X (5 ^ )+fc for 1 < k < d - 1 ( 2 ) 

7-1 


The error locator: 

xcx) = n o - = e ( i = e wv' 

x^.ex ;=i /=o 

(4b) 

where X 0 = 1 • 

The errata locator : 

v+r v+r 

r(x) = A(x)X(x) = n 0 -X,*) = E ^V' 

7=1 7=0 

(4c) 

where t 0 = 1 . 

The errata evaluator: 


where A) is either the ;th erasure or error location, and Yj is 
either the ;th erasure or the error magnitude. Define the set 
A = {Xj \X ( is an erasure location} and X = {X t \X ( is an error 
location}. Define the syndrome polynomial 


«*> - £ W« <3a) 

k = 1 

Then it is not difficult to show (see [14] ) that 


v+r 

A (x) = £ Y.Xf |fj (1 -Xx)j (4d) 


In terms of the polynomials defined above, Eq. (3b) becomes 


S(x)t(x) = AQc) + x*~' 'jT'Yrf**-' (fl (1 

7=1 \ l+i 



d - 1 


Six) = E 5 ( 


,k- 


fc=l 


(*-!)+* ‘ 


v+r Y.tf! 

t _ v 

~ * V (l -*.*) 

7=1 1 


From Eq. (5), one obtains the congruence relation, 

S(x)t(x) = A (x) mod (6a) 


v+r 

E u 

(1-Xjx) 

7=1 7 


(3b) 


Following [14], we define four different polynomials as 
follows: 


The erasure locator : 


A(*)« fl (l - X.x) = 2 0 X f x) = 

XfA j=l 


v 


E (-D / A / 


(4a) 


It is shown [9] that Eq. ( 6 a) can be solved to yield 

sw * x^) modJ^ ''' , (6b) 

It is well known, e.g., see [15] , that the maximum num- 
ber of errors in an RS code which can be corrected is 
\Jd - 1 - v)/ 2 J where [xJ denotes the greatest integer less 
than or equal to x, i.e., the principal part ofx. We now define 
the Forney syndrome polynomial. 

Definition 1 : The Forney syndrome polynomial is defined 
by 


where A 0 = 1 . 


T(x) = S(x)A(x) mod x d ~ x (7) 
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By Eq. (7), the key in Eq. (6b) for X(x) and A (x) is: 


( 8 ) 


where both the inequalities, deg (r(x)} < [_(c? + v ~ 1 )/2 J and 
deg {A (x)} < [{d + v - 3)/2J , are satisfied. 


T(x) = 4rrmodx d - 1 
X(x) 

where 


Proof: Theorem 1 is a proof [9] that the idea in [8] is 
correct. 


deg {\(x)} < \£d - 1 - v)/2J 


and 


deg {A(x)} < [_{d + v - 3)/2J 

It is shown in the following theorem that the errata evalu- 
ator polynomial A(x) and the errata locator polynomial r(x) 
can be obtained simultaneously and simply from knowing 
T(x ) in Eq. (7) and the new key equation in Eq. (8), which 
takes into account both errors and erasures. 

Theorem 1 : Let T(x) in Eq. (7) be the Forney syndrome 
polynomial of a /-error and v-erasure correcting RS code under 
the condition v + 2/<g?- 1 where d - 1 is either an even or 
an odd integer. Consider the two polynomials M(x) = x d ~ l 
and T(x) = SO*) A(x) mod x d ~ l . Then the Euclidean algo- 
rithm for polynomials on GF(2 m ) can be used to develop two 
finite sequences R s (x ) and t s (x) from the following two recur- 
sive formulas: 

r * w = K-i (*)) Vi (*) + t ,-2 (*) ( 9a ) 

and 

R s (x) = R s _ 2 (*) - Q s _ 1 (x) R s l (x) (9b) 

for (s = 1,2,...), where the initial conditions are r 0 (x) = 
A(x), r_j (x) = 0, tf_,(x) = Mix'), and R 0 (x) = T{x). Here 
Q s _ 1 (x) is obtained as the principal part of R s _ 2 (x)/R s _ 1 (x). 
The recursion in Eq. (9) for R s (x) and r s (x) terminates when 
deg {/?j(x)} < v + w- 1 for the first time for some value s = s'. 
Let 

R s (x) 

A(x) = (10a) 

and 

vW 

r(x) = — (10b) 

Also in Eq. (10), A = 7y(0) is a field element in GF(2 m ) 
which is chosen so that r 0 = 1 . Then A (x) and r (x) in Eq. (10) 
are the unique solutions of 

A(x) = T(x) t(x) mod x d ~ l (10c) 


The roots of r(x) are the inverse locations of the / errors 
and v erasures. These roots are most efficiently found by the 
Chien search procedure. By Eq. (4d) it is readily shown that 
the errata values are 



for 1 < A: < y + 7 (11) 


where r'(X k l ) is the derivative with respect to x of r(x), 
evaluated at jc = X k l . 

The overall time domain decoding of RS codes for correct- 
ing errors and erasures using the Euclidean algorithm is sum- 
marized in the following steps: 

(1) Compute the transform of the received m-tuple vector 
over GF( 2 m ) from Eq. (2). Next, calculate the erasure 
locator polynomial A(*) from Eq. (4a) and define 
deg { A(x)} = v . 

(2) Compute the Forney syndrome polynomial from T(x) 
in Eq. (7). 

(3) Determine the errata locator polynomial t(jc) and 
errata evaluator polynomial A( jc), where 0 < v < 
d - 1, by applying the Euclidean algorithm to x d ~ l 
and T(x) as given by Eq. (7). The initial values of the 
Euclidean algorithm are r 0 (x) = A(x), r_ x (jc) = 0, 

(*) -x d ~ l , and R 0 (x)-T(x). The recursion in 
Eq. (9) for R s (x ) and r^(x) terminates when deg 
{R 5 (x)} < L(^ + v - 3)/2 J for the first time for some 
value s = s. Finally, compute r(x) and A(x) from 
Eq. (10). For v = d - 1, set r(x) = A(x) and /4(x) = 
T(x). 

(4) Compute the errata values from Eq. (11). 

To illustrate the time domain decoding procedure for 
correcting errors and erasures, an elementary example of an 
RS code over GF( 2 4 ) is now presented. The representation of 
the field GF( 2 4 ) generated by the primitive irreducible poly- 
nomial #(x) = x 4 + x + 1 is given in Appendix A. 

Example 1 : Consider a (15, 9) RS code over GF( 2 4 ) with 
minimum distance d = 7. In this code, v erasures and / errors 
under the condition 2/ + v < d - 1 can be corrected. In order 
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to simplify this example, let 7 = a and b = 1 . Thus, the genera- 
tor polynomial of such a (15, 9) RS code is defined by 

6 

g(x) = (x - a 1 ) = x 6 + a 10 x 5 + a 14 x 4 
/= l 

+ a 4 x 3 + a 6 x 2 + a 9 x + a 6 


The syndromes S k for r are 
14 

s k = ^2 r n ank = a 7 (a 3 )* + a 2 (a 7 )* + a H (<* 10 )* 

«= o 

for 1 < k < 6 


Assume the message symbols are 
/(x) = a 10 x 14 + a 12 x 13 +a 8 x 12 +a 5 x u + a 6 x 10 
+ a 14 x 9 + a 13 x 8 + a n x 7 +a 9 x 6 

The encoded code word, which is a multiple of £(x), is 

c(x) = a 10 x 14 + a 12 x 13 + a 8 x 12 +a 5 x u + a 6 x 10 
+ a 14 x 9 + a 13 x 8 + a n x 7 + a 9 x 6 +x 5 


This yields S { = a 0 , S 2 = a 13 , S 3 = a 14 , S A = a 11 , S 5 = a, 
and S 6 = 0. Thus, the syndrome polynomial is 5(x) = a 0 J 
a l3 x + a 14 x 2 + a 1J x 3 + ax 4 + Ox 5 . 

The erasure locator polynomial is A(x) = (1 + a 7 x). In this 
example, the maximum erasure correcting capability is 

LCrf-1 -v)/2J = L(7 - 1 - 1)/2J =2 
By Eq. (7), one obtains the Forney syndrome polynomial as 


+ ax 4 + <* 2 x 3 +a 6 x 2 + a 12 x + a 8 


jf(x) = A(x)S(x) = (1 + a 7 x) (1 + a 13 x + a 14 x 2 + a n x 3 


Written as a vector, the code word is 


= (a 10 , a 12 , a 8 , a 5 , a 6 , a 14 , a 13 , a 11 , a 9 , a 0 , a, 


a 2 , a 6 , a 12 , a 8 ) 


Assume the erasure vector is 


u = (0, 0, 0, 0, 0, 0, 0, oc 2 , 0, 0, 0, 0, 0, 0, 0) (12) 

and the error vector is 

e = (0,0, 0,0, a 11 , 0,0, 0,0, 0,0, a 7 , 0,0,0) 

(13) 


Then the errata vector is 

u = u + e - (0, 0, 0, 0, a 11 , 0, 0, a 2 , 0, 0, 0, a 7 , 0, 0, 0) 

(14) 


+ ax 4 + Ox 5 ) mod x 6 
= (0x 6 + a 8 x 5 + a 9 x 4 +ax 3 + a 12 x 2 
+ a 5 x + a°)modx 6 

= a 8 x 5 +a 9 x 4 + ax 3 +a 12 x 2 +a 5 x +a° 

(16) 

In Eq. (16), the coefficients of T(x), T 0 = a 0 , T t = a 5 , T 2 - 
ot 12 , T 3 = a, F 4 = a 9 , and T s = a 8 are the Forney syndromes. 

The Euclidean algorithm is applied next to polynomial 
x**- 1 and 7Xx) in Eq. (16). By this means, polynomials r(x) 
and A (x) are determined next by use of the Euclidean algo- 
rithm. This is accomplished by the recursive Eqs. (9a) and 
(9b) illustrated in Table 1, where initially R_ A (x) = x^" 1 = 
x 6 and R 0 (x) = a 8 x 5 + a 9 x 4 + ax 3 + a 12 x 2 + a 5 x + 1. 
From Table 1, one observes that deg (i^(x)} = deg ( J R 2 (x)}= 
2 < [_(d + v - 3)/2 J = 2. Thus, the computation terminates 
at s' = 2, and 


Assume the received vector is 

r = e + u = (a 10 , a 12 , a 8 , a 5 , a, a 14 , a 13 ,a 9 , 


a 9 , a 0 , a, a 12 , a 6 , a 12 , a 8 ) (15) 


and 


R 2 (x) = a 1 x 2 + ax +a 2 (17a) 


r 2 (x) = a 7 x 3 + a 13 x 2 + a 4 x + a 2 (17b) 
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By Eqs. (10a) and (10b), one has 


Summing Eq. (21) over i for 1 < i < v + t produces 


t(x) =— r 2 (x ) = a s x 3 + q u x 2 + a 2 x + 1 (18) 

a 2 

and 

A(x) = — R 2 (x) = q s x 2 +q 14 x + l (19) 

a 2 

By using Chien’s search, the roots of t(x) constitute the set 
(a -7 , a -3 , a -10 }. The derivative with respect to x of t(x) in 
Eq. (18) is t'(x) = a 5 x 2 + a 2 . Thus, the errata values are 

y _ _ i4(a~ 7 ) _ q 5 (q~ 7 ) 2 + q 14 (q~ 7 ) + 1 _ 

1 /(AT -1 ) T'(q -7 ) q s (q _7 ) 2 +q 2 

Y _ AX j l _ q 5 (q~ 3 ) + q 14 (q~ 3 )+ 1 _ a i 
2 t\X~ x ) q 5 (q -3 ) 2 + q 2 

and 

a 5 (q~ 10 ) 2 +q 14 (q~ 10 )+ 1 _ 

73 t'(X~ 1 ) q s (q -10 ) 2 + q 2 


v+t v+t 

J2 Y tf b ~ l)+k - t iT, y rf b ~ 1)+k ~ l 

<= 1 1=1 

v+t 

+ . . . + (-i) v+f r v+f £ = 0 

/=1 

( 22 ) 

From Eq. (22), one has 

^(6-1)+*: “ T l‘^(6-l)+*'-1 

(23) 

Hence, in general, 

F - T F 

n {b^\)+k 

+ . . . + (-i)' +v f) = 0 

ioxk>d (24) 


III. The Transform Decoder for RS Codes 

The transform decoder of Gore and Mandelbaum [17], 
[18] was developed further in [10] to correct both errors and 
erasures. This decoding procedure was based on the algorithm 
originally invented by Forney [13] (also see [10]). By the 
above-mentioned Euclidean algorithm, the transform domain 
decoding procedure in [10] can be simplified further. 

By the same procedure used in the time domain decoder, 
one can obtain the errata locator polynomial given in Eq. (4c). 
Hence, 

T{X?) = 1 + (—l)^", (xr l ) + (~i) 2 t 2 (xr 1 ) 2 
+ ... + (-i) v+f V f (^r 1 ) v+f = 0 

for 1 < i < v + t (20) 

Multiplying Eq.(20)by Y i xf b ~ 1 '* +k yields 

+ +(-l) v+l r Y.X< b ~ D+*-(v+0 = 0 

T ‘ ’ ’ V J V+t l l 

( 21 ) 


are the recursive equations for Ej , the transforms of the errata 
pattern, where initially E b = S b ,E b+l -S b+1 >... >E b+d _ 2 are 
known from the prior syndrome calculation. 

From Eq. (24), one obtains the rest of the transform of u, 
i.e., the S Q for 0 < C < N - 1*. The amplitude u vector is found 
by taking the inverse transform over GF(2 m ) of S Q , 0 < C < 
N - 1 . Finally, the original m-tuple code vector can be obtained 
by subtracting u from the received vector r . 

Let us now recapitulate the above transform decoding algo- 
rithm of RS codes for correcting both errors and erasures, 
using transforms over GF( 2 m ) and the Euclidean algorithm. 
This procedure is composed of the following five steps: 

(1) Use step 1 in the time domain decoder. 

(2) Use step 2 in the time domain decoder. 

(3) Use step 3 in the time domain decoder. 

(4) Compute the rest of the transform of the errata vector 
by the use of Eq. (24). 

(5) Invert the transform to recover the errata vector using 
the fact that S 0 = S N . Then obtain the corrected code 
vector. 
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To illustrate the transform domain decoder for correcting 
errors with erasures, the data for the (15,9) RS code over 
GF{ 2 4 ) used in Example 1 is again used. 

Example 2: Consider the (15,9) RS code over GF( 2 4 ) 
with d-1. For this code, the erasure, error, errata, and 
received vectors are given by Eqs. (12), (13), (14), and (15), 
respectively. By Eq. (18), the errata locator polynomial is 

r(x) = r 0 +T \ X +r 2 * 2 +r 3 * 3 = 1 + + Q U x 2 + a 5 x 3 

where r 0 = 1, t 1 = a 2 , r 2 = a 11 , and r 3 = a 5 . 

By Eq. (23), the rest of the transform of the errata vector is 

(25) 

That is, S 7 = a 13 , S s = a 13 , S 9 = a 7 , S l0 = a 3 , S n = a 5 , 
5 a2 = a 13 9 $13 = a 5 > * 5 > i 4 = a 5 > *^0 = 1 • 

The inverse transform of S k is 

15-1 

U k = Yj S n a ~ nk for 0 < £ < 14 

n — 0 

The result is u = (0, 0, 0,0, a 1 1 ,0,0, a 2 , 0,0,0, a 7 , 0,0,0). 
The corrected code is thus 


functional units in this part are (1) the syndrome computa- 
tion unit; (2) the power calculation unit; (3) the power expan- 
sion unit; (4) the polynomial expansion unit; and (5) the 
L(d + v - 3)/2J generator. Also included in this part are some 
delay registers. The lengths of the delay registers may not be 
equal in these two decoder architectures, but since they con- 
tain only replicated register cells, they can be considered 
identical in architecture. 

Figure 3 shows the block diagram of the syndrome compu- 
tation unit. This unit accepts the received messages and com- 
putes their syndromes. There are 32 syndrome subcells in a 
(255,223) RS decoder. Each subcell depicted in Fig. 3 per- 
forms the operation as S t «- (S ( + where denotes 

the operation “is replaced by.” The Berlekamp multiplier is 
used in this syndrome unit due to its simplicity in VLSI 
design [19]. The computed syndrome polynomial is labeled 
as S(x ) in both Figs. 1 and 2. In the time domain and trans- 
form domain decoders, the coefficients of S(x) are fed in 
parallel to the polynomial expansion unit to compute the 
Forney syndromes. 

The power calculation unit converts the received 1 ’s and 0’s 
into a sequence of ot k ’s and 0’s, where a is a primitive element 
of the finite field over which the RS code is defined. These 
received l’s and 0’s indicate the occurrence or nonoccurrence, 
respectively, of an erasure at a specific location. Figure 4 
shows the block diagram of the power calculation unit. Since 
the maximum erasure correcting capability of a (255, 223) RS 
decoder is 32, only 32 symbol latches are needed to store the 
locations of all the correctable erasures. 


c = 


r - u = (a 10 , a 


12 a 8 

> 3 


5 14 

a, a, a , 


a 13 , a 9 . 


, a 0 , a. 


a 12 , a 6 , a 12 , a 8 ) 

-(0,0, 0.0. a 11 , 0,0, a 2 , 0,0,0, a 7 , 0,0,0) 


A detection circuit for detecting the occurrence of erasures 
is included in the power calculation unit. If an erasure occurs 
at the fcth location, its corresponding symbol a k is calculated 
and latched. This a k 's sequence is fed to the polynomial 
expansion circuit, to the power expansion unit, and to the 
\id + v - 3)/2 J generator. 


(a 10 , a 12 , a 8 , a 5 , a 6 , a 14 , a 13 , a 11 , a 9 , a 0 , a, 
a 2 , a 6 , a 12 , a 8 ) 


IV. A Comparison of VLSI Architecture of 
the Transform Decoder and the Time 
Domain Decoder 

The block diagram of a (255, 223) RS time domain decoder 
is depicted in Fig. 1. Figure 2 shows the block diagram of a 
(255, 223) RS transform domain decoder. Each block diagram 
can be separated into two parts, indicated by broken lines, as 
shown in both Figs. 1 and 2. The first part, labeled as “I” in 
both block diagrams, has similar VLSI architecture. The major 


The power expansion unit converts the a k ’s sequence into 
an erasure locator polynomial A(x) which has a k 's as its 
roots. Figure 5 depicts the block diagram of this unit. The 
erasure locator polynomial A(x) is fed to the modified GCD 
unit as one of the initial conditions. 

A generator is used to compute |_(^ + v - 3)/2J . This is 
shown in both Figs. 1 and 2. The output of this generator is 
sent to the modified GCD unit and used as a stop indicator for 
Euclid’s algorithm 

Figure 6 presents a block diagram of the polynomial expan- 
sion circuit. The Forney syndromes for either the time domain 
decoder or the transform decoder are calculated in this unit. 
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Figure 7 depicts the block diagram of the modified GCD 
unit. As described previously in [6] , a multiplexing scheme 
can be applied to the modified GCD unit to reduce the num- 
ber of cells needed. The polynomial A(x) together with the 
Forney syndrome polynomial r(x) are the two inputs to the 
modified GCD unit. The output of the modified GCD unit is 
the errata locator polynomial r(x) and the errata evaluator 
polynomial A (jc). The error correcting capability of the code 
is computed by [_(32 - v)/2J . 

The differing functional units of the time and transform 
domain decoders are shown in the second half of Figs. 1 and 2 
and are labeled as “II.” One output of the modified GCD unit 
of the time domain decoder, the errata locator polynomial 
r( x), is fed to a Chien search unit and to another unit for com- 
puting [j < b - l r'(x)] = [x 111 t ; (x)] _1 , where 6 = 112 in this 

design. The other output of the modified GCD unit of the time 
domain decoder, the errata evaluator polynomial A(x), is fed 
to the polynomial evaluation unit to perform the evaluation of 
v4 (jc)* Figure 8 shows the block diagram of the polynomial 
evaluation unit. 

The [x 111 unit performs the calculation of one part 

of the errata magnitude [6] . Figure 9 depicts the block dia- 
gram of this unit. The product of the outputs of the polyno- 
mial evaluation unit and the [x in /(x)] unit forms the 
errata magnitude. 

In the time domain decoder, the Chien search unit is used 
to search for the error and erasure locations; for more details, 
see [6] . The architecture of the Chien search unit is similar to 
that of the polynomial evaluation unit, except there is a zero 
detector at the end. 

On the other hand, for the transform domain decoder 
design, the output from the modified GCD unit is the errata 
locator polynomial r(x). This output is fed to the transform 
error pattern unit, along with the syndromes from the syn- 
drome computation unit, to calculate the extended syndromes. 
A new architecture for the transform of the error -pattern unit 
is developed in Appendix A. The realization of this idea is 


shown in the block diagram of the transform of the error- 
pattern unit, given in Fig. 10. 

The computation of extended syndromes, together with the 
original syndromes, is sent to the inverse transform unit to 
obtain the estimated error patterns. Figure 11 shows the block 
diagram of the inverse transform error-pattern unit. It is easy 
to see that the architecture for the inverse transform unit is 
similar to that of the syndrome computation unit except that 
255 subcells are needed in the inverse transform unit while the 
syndrome computation unit needs 32 subcells. 

Clearly, the architecture of the transform domain decoder 
lesign is simpler than that of the time domain decoder design. 
This is because the transform domain decoder design needs 
only two regular function blocks in part II of Fig. 2. However, 
the time domain decoder requires three function blocks for 
the implementation in part II of Fig. 1 . 

Furthermore, the inverse-transform unit in the transform 
domain design contains 255 similar cells in the (255, 223) RS 
decoder. It is estimated that these 255 cells occupy only a 
moderate amount of silicon area, and that their geometric 
arrangement can be regular and simple. Therefore, substantial 
time for the design and test of such a VLSI chip can be saved. 
However, the advantage of the transform domain decoder is 
valid only for moderately short length RS codes. If long length 
RS codes are used to enhance the system’s performance [12] , 
the transform domain decoder needs a large inverse transform 
block. This might cause a problem in the VLSI implementa- 
tion. In general, if a GF(2 m ) field is used to define an RS 
code, an inverse transform block composed of 2 m - 1 cells is 
needed. Hence, the number of cells needed in an inverse trans- 
form block increases exponentially with the integer m. How- 
ever, the number of transistors needed in the time domain 
decoder goes up only linearly as the integer m increases. 
Therefore, for long length codes, the time domain decoder 
is the more appealing approach. Although the computation 
of the time domain decoder is more complex than that of the 
transform domain decoder, for long RS codes the number of 
transistors needed in a time domain decoder is substantially 
less than that in a transform domain decoder. 
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Appendix A 

New Architecture for the Transform of the Error Pattern Unit 


In this appendix, a VLSI architecture is developed to com- 
pute the transform of the error pattern. Recently, Johnson 
et at [20] proposed a systolic array for computing a linear 
convolution. Using a technique similar to that suggested by 
Johnson et al . , the transform of the error pattern in Eq. (23) 
or Eq. (24) can be implemented in a systolic array. The advan- 
tage of this method over the previous method in Fig. 15 of [5] 
is that the long delay needed in the large XOR tree used for 
summing all the terms in Eq. (23) is eliminated. Also, the zero 
detectors needed in the previous design [5] are not required in 
this new architecture. 

To illustrate this new architecture, the data in Example 2 
for a (15,9) RS code are used here as an example. The recur- 
sive equation to compute the remainder of the transform of 
the error pattern is given in Eq. (25). The new design for 
computing Eq. (25) is shown in Fig. A-l. In this figure, the 
function of each cell can be described by a register transfer 


relation of the type «- R i+l + S k a* . The input data are 
sent to all the cells simultaneously. 

To understand the operation of this circuit, assume initially 
that all registers R f for 1 < / < 3 are set to zero. The control 
signal C is high for 6 symbol clocks to allow data S x , S 2 , . . . , 
S 6 to be fed into the circuit. The input data are also sent to 
the output node. At the same time, the complement signal C 
of signal C is low to prevent the data stored in register R t 
from being sent to the output node. Note that one “clock 
time” for one Galois field symbol equals 4 circuit clock times. 
At the seventh symbol clock time the control signal C is 
switched to low or zero so that C = 1. Therefore, the data 
stored in register R v which equals S 7 at that moment, is sent 
to the output node and fed back to all basic cells. This pro- 
cess continues until the rest of the transform of the error 
pattern, i.e., 5 7 , S 8 , . . . ,5' 1S , is obtained. The detailed opera- 
tion of this circuit is described in Table A-2. 
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Table A-1. Representations of the elements of GF( 2 4 ) generated by a 4 + a + 1 = 0 
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Table A-2. The fifteen steps of the transform of the error pattern algorithm 
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